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DETAILED ACTION 
Response to Arguments 

The rejections of claims under 35 USC 1 12, second paragraph and first paragraph-written 
description, are withdrawn in view of the amendment to the claims or moot in view of cancellation 
of the claims. A new rejection appears below due to the amendment. 

The rejections of claims under 35 USC 102 as anticipated by US 6,607,879 and 5,952,223 
are withdrawn in view of the amendment to the claims. Neither of the cited US Patents or the 
discloses the full coding sequence of SEQ ID NO:32 or a nucleic acid at least 99% identical thereto. 
Note there is a new rejection under 35 USC 102. 

Claim Interpretation 

For claims drawn to "An isolated nucleic acid comprising a nucleic acid sequence" of SEQ 
ID NO:32 or encoding a polypeptide of SEQ ID NO:33 (Claim 124), the specification appears to 
intend that the nucleic acid and amino acid sequences are contiguous, but the isolated nucleic acid 
and polypeptide may include additional nucleic acid or amino acid sequence, respectively, on either 
end of said sequence {e.g., p. 304, lines 19-25). With this interpretation, the splice variant of 
GenBank AB01 1 1 12.1 which encodes a polypeptide identical to SEQ ID NO:33 of the instant 
application with the exception of a 30 amino acid insert and is encoded by a nucleic acid identical to 
SEQ ID NO:32 with the exception of a 90 nucleotide insert, would not qualify as a nucleic acid 
comprising the nucleic acid sequence of SEQ ID NO:32 or encoding the polypeptide SEQ ID 
NO:33. For this reason, the GenBank reference is not being applied to claim 124. If Applicants 
feel this interpretation is in error, this should be stated in the response to this office action and will 
not preclude the next Office action from being made final since the issue was raised here. 

This interpretation does not apply to isolated nucleic acids 99 % identical (or some other % 
identity) to SEQ ID NO:33, which allows gaps/inserts (see art rejection below). 

Claim Objections 

Claims 123-125 and 129 and 130 are objected to because of the following informalities: the 
SEQ ID NO: appears in parenthesis, which appears to be an advertent typographical error 
introduced by the amendment to the claims. Appropriate correction is required. 
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Claim Rejections - 35 USC § 112 
The following is a quotation of the second paragraph of 35 U.S.C. 112: 

The specification shall conclude with one or more claims particularly pointing out and distinctly claiming the subject 
matter which the applicant regards as his invention. 

Claim 123 is rejected under 35 U.S.C. 112, second paragraph, as being indefinite for failing 
to particularly point out and distinctly claim the subject matter which applicant regards as the 
invention. 

Claim 123 is dependent on cancelled claim 1 19, so the metes and bounds of the claim are 
not clear. 

Claim Rejections - 35 USC § 112 
The following is a quotation of the first paragraph of 35 U.S.C. 112: 

The specification shall contain a written description of the invention, and of the manner and process of making and 
using it, in such full, clear, concise, and exact terms as to enable any person skilled in the art to which it pertains, or 
with which it is most nearly connected, to make and use the same and shall set forth the best mode contemplated by 
the inventor of carrying out his invention. 

Claims 123-125 and 135-138 remain rejected under 35 U.S.C. 1 12, first paragraph, because 
the specification, while being enabling for the nucleic acid of SEQ ID NO:33 or a nucleic acid 
sequence at least 99% identical to SEQ ID NO:32 wherein said nucleic acid is amplified in 
squamous cell-type lung carcinomas, does not reasonably provide enablement for a 99% identical 
nucleic acid amplified in colon tumors or for a nucleic acid encoding a polypeptide of SEQ ID 
NO:33. The specification does not enable any person skilled in the art to which it pertains, or with 
which it is most nearly connected, to use the invention commensurate in scope with these claims 
and for the reasons set forth in the previous Office action and for the following reasons directed to 
the claims as amended. 

The amendment to claim 123 introduces a functional limitation requiring said nucleic acid 
be amplified in squamous cell-type lung carcinomas (SqCCa) and colon tumors; however, while 
there is enablement for the amplification in squama cell-type lung carcinomas as indicated in the 
previous Office action on page 5, lines 15-18, with 5/1 1 SqCCa showing a ACt > 1.0, only 2/17 
colon tumors had a ACt > 1.0, which is not a significant number on which to base enablement as a 
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colon tumor marker. Also, as stated in the previous Office action on page 5, lines 18-21, "Detection 
is predicated on structural relationship of the nucleic acid probe to nucleic acids) in the tumor tissue. 
One skilled in the art would not predict that detection would have the required sensitivity if the 
probe sequence differed from SEQ ID NO:33 such as the sequence of a degenerate nucleic acid or a 
hybridizing nucleic acid." Claim 124 is drawn to a nucleic acid encoding a polypeptide of SEQ ID 
NO: 33, which because of degeneracy of the genetic code would not be enabled unless the nucleic 
acid had the sequence of SEQ ID NO:32. 

Applicants argue that the amendment to the claims provides enablement. The argument has 
been fully considered, but is not persuasive. For the reasons set forth above and in the previous 
Office action v the amendments do not make the claims listed in the rejection above due to inclusion 
of amplification in colon tumors or to inclusion of degenerate encoding nucleic acids. 

Priority 

Additionally, Applicants' argument directed to priority are persuasive in that priority 
application 60/141,037 (06/23/99, e.g., p. 161 line 20) does disclose the enabling gene amplification 
data also found in the instant application. 

Claim Rejections - 35 USC § 102 
The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that form the 
basis for the rejections under this section made in this Office action: 
A person shall be entitled to a patent unless - 

(a) the invention was known or used by others in this country, or patented or described in a printed publication in this or a 
foreign country, before the invention thereof by the applicant for a patent. 

Claim 123 is rejected under 35 U.S.C. 102(a) as being anticipated by GenBank AB01 1 1 12.1. 

GenBank AB01 1 1 12.1 teaches a nucleic acid at least 99% identical to SEQ ID N0:32 of the 
instant application. See attached SEQUENCE COMPARISON GenBank ABO 11112.1. 

Note that for the nucleic acid there are only 3 mismatches over 3504 nucleotides with the 
admittance of a gap. This leads to a >99% identity. The following definitions percent identity are 
from the specification: 
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"Percent (%) nucleic acid sequence identity" with respect to PRO-encoding nucleic acid 
sequences identified herein is defined as the percentage of nucleotides in a candidate sequence that 
are identical with the nucleotides in the PRO nucleic acid sequence of interest, after aligning the 
sequences and introducing gaps, if necessary, to achieve the maximum percent sequence identity. 
Alignment for purposes of determining." (p. 309, lines 1-4) 



Claims 129-131 are allowable except for the objection to claims 129 and 130. 

Any inquiry concerning this communication or earlier communications from the examiner 
should be directed to Claire M. Kaufman, whose telephone number is (571) 272-0873. Dr. 
Kaufman can generally be reached Monday, Tuesday and Thursday from 8:30AM to 2:30PM. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's supervisor, 
Brenda Brumback, can be reached at (57 1 ) 272-096 1 . 

Any inquiry of a general nature or relating to the status of this application should be directed 
to the Group receptionist whose telephone number is (571) 272-1600. ~ 

Official papers filed by fax should be directed to (703) 872-9306. NOTE: If applicant does 
submit a paper by fax, the original signed copy should be retained by the applicant or applicant's 
representative. NO DUPLICATE COPIES SHOULD BE SUBMITTED so as to avoid the 
processing of duplicate papers in the Office. Please advise the examiner at the telephone number 
above before facsimile transmission. 

Claire M. Kaufman, Ph.D. 



Conclusion 




Unit 1646 



November 8, 2004 
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LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 



JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 
FEATURES 

source 



gene 
CDS 



AB011112 6364 bp mRNA linear PRI 05-JUL-2002 

Homo sapiens mRNA for KIAA0540 protein, partial cds . 

AB011112 

AB011112.3 GI:20521076 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 

Nagase,T., Ishikawa,K., Miyajima,N., Tanaka,A. , Kotani,H., 
Nomura, N. and Ohara,0. 

Prediction of the coding sequences of unidentified human genes. IX. 

The complete sequences of 10 0 new cDNA clones from brain which can 

code for large proteins in vitro 

DNA Res. 5 (1), 31-39 (1998) 

98290545 

9628581 

2 (bases 1 to 6364) 

Ohara,0., Nagase,T. and Ishikawa,K. 

Direct Submission 

Submitted ( 13 -FEB-1998 ) Osamu Ohara, Kazusa DNA Research Institute, 
Laboratory of DNA Technology; Yana 1532-3, Kisarazu, Chiba 
2 92-0812 , Japan (E-mail : cdnainf o@kazusa . or . jp , Tel : +81-438-52-3913 , 
Fax:+81-4 38-52-3 914) 

On May 9, 2002 this sequence version replaced gi: 6635198. 
Location/Qualifiers 
1. .6364 

/organism="Homo sapiens" 
/ mo 1_ t ype = " rnRNA " 
/db_xref ="taxon: 9606" 
/clone= n pg00116" 
/sex="male" 
/ t is sue_type= "brain" 
/clone_lib="pBluescriptII SK plus" 

/note="This sequence was replaced that of hg04185 cDNA as 
a representative cDNA sequence for KIAA0540." 
1. .6364 

/gene="KIAA054 0" 
<1. .6128 
/gene="KIAA0540" 

/note="Start codon is not identified" 
/codon_start=3 
/product ="KIAA0 54 0 protein" 
/protein_id="BAA25466 . 3" 
/db_xref ="GI :20521077" 

/ trans lation= "PMELRHLLRPRPGLDSEPGGAEAGKARHAGAVIRTLSGMARHQG 
PARALRYFDLTPSMAGIMVPPVQRWPGPGFTFHAWLCLHPMDTAPTPAPTRPLQRKQL 
YSFFTSSGSGFEAFFTAAGTLWAVCTRKEYLTMSLPEVSFADSAWHCVAIVHVPGRR 
PFSQNLVHVYKDGHLVKTAPLRCPSLSEPFSSCCIGSAGYRTTTTTTGLPTPPVPATL 
AYTHPALTRSQSVPASTGLGWGSGLVAPLQEGSIDSTLAGTQDTRWGSPTSLEGELGA 
VAIFHEALQATALRTLCTLGPNETAPFKPEGELHELSTRLLLHYSPQACKNNICLDLS 
PSHGLDGRLTGHRVETWDVKDWNCVGGMGALLPLLERVAAQPKEAEAGPAETHDLVG 
PELTSGHNTQGLVLPLGKSSEERMERNAVAAFLLMLRNFLQGHMVNQESLVQCQGPAI 
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IGALLRCTPSWAMDMNVLMSAQLLMEQVAAEGSGPLLYLLYQHLLFNFHLWTLSDFAV 
RLGHIQYMSSIVREHRQKLRKKYGVQFILDALRTHYSPQRERPLAADDLRTVQTSLLG 
LAREFLWSLSADDVQVTQTMLSFLAATGDDGQILRRLQQNERLPERSRQRLRLRECG 
LQGLVACLPEGTVSPQLCQGLYKLFLGAGSGNTAGGGGSSGTLTPASQPGTPSPLDGP 
RPFPAAPGRHSSSLSNVLEDGSLPEPTISGDDTSNTSNPQQTSEEELCNLLTNVLFSV 
TWRGVEGSDEAAWRERGQVFSVLTQLGASATLVRPPDCIKRSLLEMMLESALTDIKEA 
PVGVLASLTQQALWLLRLLQDFLCAEGHGNQELWSEKLFEGVCSLLDRLGAWPHLANG 
TADLREMAQIGLRLVLGYILLEDPQLHAQAYVRLHMLLQTAVPARREEACYVLSKLEA 
ALGRVLNTSSLESATDEAGSPLAAAAAAAAAERCSWLVPLWTLLDRAYEPLGLQWGL 
PSLPPTNGSPTFFEDFQAFCATPEWRHFIDKQVQPTMSQFEMDTYAKSHDLMSGFWNA 
CYDMLMSSGQRRQWERAQSRRAFQELVLEPAQRRARLEGLRYTAVLKQQATQHSMALL 
HWGALWRQLASPCGAWALRDTPIPRWKLSSAETYSRMRLKLVPNHHFDPHLEASALRD 
NLGEVPLTPTEEASLPLAVTKEAKVSTPPELLQEDQLGEDELAELETPMEAAELDEQR 
EKLVLSAECQLVTWAWPGLLEVTTQNVYFYDGSTERVETEEGIGYDFRRPLAQLRE 
VHLRRFNLRRSALELFFIDQANYFLNFPCKVGTTPVSSPSQTPRPQPGPIPPHTQVRN 
QVYSWLLRLRPPSQGYLSSRSPQEMLRASGLTQKWVQREISNFEYLMQLNTIAGRTYN 
DLSQYPVFPWVLQDYVSPTLDLSNPAVFRDLSKPIGWNPKHAQLVREKYESFEDPAG 
TIDKFHYGTHYSNAAGVMHYLIRVEPFTSLHVQLQSGRFDCSDRQFHSVAAAWQARLE 
S PADVKEL I PEFFYFPDFLENQNGFDLGCLQLTNEKVGD WLPPWAS S PEDF I QQHRQ 
ALESEYVSAHLHEWIDLIFGYKQRGPAAEEALNVFYYCTYEGAVDLDHVTDERERKAL 
EGIISNFGQTPCQLLKEPHPTRLSAEEAAHRLARLDTNSPSIFQHLDELKAFFAEWS 
DGVPLVLALVPHRQPHS F I TQGS PDLLVTVS ASGLLGTHS WLP YDRNI SNYFS FS KDP 
TMGSHKTQRLLSGPWVPGSGVSGQALAVAPDGKLLFSGGHWDGSLRVTALPRGKLLSQ 
LSCHLDWTCLALDTCGIYLISGSRDTTCMVWRLLHQGGLSVGLAPKPVQVLYGHGAA 
VSCVAISTELDMAVSGSEDGTVIIHTVRRGQFVAALRPLGATFPGPIFHLALGSEGQI 
WQSSAWERPGAQVTYSLHLYSVNGKLRASLPLAEQPTALTVTEDFVLLGTAQCALHI 
LQLNTLLPAAPPLPMKVAIRSVAVTKERSHVLVGLEDGKLIWVAGQPSEVRSSQFAR 
KLWRS SRR I SQVSSGETEYNPTEAR " 

ORIGIN 

Query Match 96.6%; Score 2907.4; DB 9; Length 6364; 

Best Local Similarity 97.1%; Pred. No . 0 ; 

Matches 3008; Conservative 0; Mismatches 1; Indels 90; Gaps 1; 
Qy 1 ATGTCCCAGTTCGAAATGGACACGTATGCTAAGAGCCACGACCTTATGTCAGGTTTCTGG 60 

MillillillilMIIIIIMi Mill I lllllllllllllilllllill III IIIIM 

Db 3 02 7 ATGTCCCAGTTCGAAATGGACACGTATGCTAAGAGCCACGACCTTATGTCAGGTTTCTGG 3 0 8 6 

Qy 61 AATGCCTGCTATGACATGCTTATGAGCAGTGGGCAGCGGCGCCAGTGGGAGCGCGCCCAG 12 0 

MM MilM IIMi: IIIIM II MMII ll.illlll M IMII! Ill 

' Db 3 087 AATGCCTGCTATGACATGCTTATGAGCAGTGGGCAGCGGCGCCAGTGGGAGCGCGCCCAG 3146 



Qy 121 AGTCGTCGGGCCTTCCAGGAGCTGGTGCTGGAACCTGGGCAGAGGCGGGCGCGCCTGGAG 18 0 

IM MMII IIIIM IIIIM II MMM M MUM M IIIIM MM 

Db 314 7 AGTCGTCGGGCCTTCCAGGAGCTGGTGCTGGAACCTGCGCAGAGGCGGGCGCGCCTGGAG 32 06 

Qy 181 GGGCTACGCTACACGGCAGTGCTGAAGCAGCAGGCAACGCAGCACTCCATGGCCCTGCTG 24 0 

MM MMIMMMII MMM II IIIIM II IIIIM II IMIIMMI 

Db 32 0 7 GGGCTACGCTACACGGCAGTGCTGAAGCAGCAGGCAACGCAGCACTCCATGGCCCTGCTG 32 66 

Qy 241 CACTGGGGGGCGCTGTGGCGCCAGCTCGCCAGCCCATGTGGGGCCTGGGCGCTGAGGGAC 3 00 

i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3 2 67 CACTGGGGGGCGCTGTGGCGCCAGCTCGCCAGCCCATGTGGGGCCTGGGCGCTGAGGGAC 3 32 6 
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Qy 301 ACTCCCATCCCCCGCTGGAAACTGTCCAGCGCCGAGACATATTCACGCATGCGTCTGAAG 360 

I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3 32 7 ACTCCCATCCCCCGCTGGAAACTGTCCAGCGCCGAGACATATTCACGCATGCGTCTGAAG 3 3 86 

Qy 3 61 CTGGTGCCCAACCATCACTTCGACCCTCACCTGGAAGCCAGCGCTCTCCGAGACAATCTG 42 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 

Db 3 3 87 CTGGTGCCCAACCATCACTTCGACCCTCACCTGGAAGCCAGCGCTCTCCGAGACAATCTG 3446 

Qy 421 GGTGAGGTTCCCCTGACACCCACCGAGGAGGCCTCACTGCCTCTGGCAGTGACCAAAGAG 480 

llllll llllll II llllll MIIMII IIMII IIMIIIII MMMI I IIMIMI 

Db 3447 GGTGAGGTTCCCCTGACACCCACCGAGGAGGCCTCACTGCCTCTGGCAGTGACCAAAGAG 3 506 

Qy 481 GCCAAAGTGAGCACCCCACCCGAGTTGCTGCAGGAGGACCAGCTCGGCGAGGACGAGCTG 54 0 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I M I I I I 
Db 35 0 7 GCCAAAGTGAGCACCCCACCCGAGTTGCTGCAGGAGGACCAGCTCGGCGAGGACGAGCTG 3 5 66 

Qy ~ 541 GCTGAGCTGGAGACCCCGATGGAGGCAGCAGAACTGGATGAGCAGCGTGAGAAGCTGGTG 600 

IMIII IIIIIIIIIIMlMMMIIM II llllll II llllll MUM 

Db 3 567 GCTGAGCTGGAGACCCCGATGGAGGCAGCAGAACTGGATGAGCAGCGTGAGAAGCTGGTG 3 62 6 

Qy 601 CTGTCGGCCGAGTGCCAGCTGGTGACGGTAGTGGCCGTGGTCCCAGGGCTGCTGGAGGTC 660 

MMM MMMMMMMMMIMM M IMMIMI MMM IIIMM < 

Db 3 62 7 CTGTCGGCCGAGTGCCAGCTGGTGACGGTAGTGGCCGTGGTCCCAGGGCTGCTGGAGGTC 3 686 

Qy 661 ACCACACAGAATGTATACTTCTACGATGGCAGCACTGAGCGCGTGGAAACCGAGGAGGGC 72 0 

MMM MMMMMMIMMMMM II MMM II Mill llllll 

Db 3 687 ACCACACAGAATGTATACTTCTACGATGGCAGCACTGAGCGCGTGGAAACCGAGGAGGGC 3 746 

Qy 721 ATCGGCTATGATTTCCGGCGCCCACTGGCCCAGCTGCGTGAGGTCCACCTGCGGCGTTTC 7 80 

llllll MMMIMMIMM IIMM M llllll II MUM llllll 

Db .3 74 7 ATCGGCTATGATTTCCGGCGCCCACTGGCCCAGCTGCGTGAGGTCCACCTGCGGCGTTTC 3 8 06 

Qy 781 AACCTGCGCCGTTCAGCACTTGAGCTCTTCTTTATCGATCAGGCCAACTACTTCCTCAAC 84 0 

MIIIIIIMIMI Mill I llllll II IIMII MIIIIMI MMMI MIMMII 

Db 3 807 AACCTGCGCCGTTCAGCACTTGAGCTCTTCTTTATCGATCAGGCCAACTACTTCCTCAAC 3 866 

( 

Qy 841 TTCCCATGCAAGGTGGGCACGACCCCAGTCTCATCTCCTAGCCAGACTCCGAGACCCCAG 900 

I Ml II MINIMI I II II I llllll II IIMII llillllll llllll Ml Ml I II 

Db 3 867 TTCCCATGCAAGGTGGGCACGACCCCAGTCTCATCTCCTAGCCAGACTCCCAGACCCCAG 3 92 6 

Qy 901 CCTGGCCCCATCCCACCCCATACCCAGGTACGGAACCAGGTGTACTCGTGGCTCCTGCGC 960 

I Ml II I IIMII M llllll llllll II IIMM 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 3 92 7 CCTGGCCCCATCCCACCCCATACCCAGGTACGGAACCAGGTGTACTCGTGGCTCCTGCGC 3 986 

Qy 961 CTACGGCCCCCCTCTCAAGGCTACCTAAGCAGCCGCTCCCCCCAGGAGATGCTGCGTGCC 102 0 

I II 1 1 II I M II I M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II I II II 1 1 II II 1 1 M II M 1 1 

Db 3 987 CTACGGCCCCCCTCTCAAGGCTACCTAAGCAGCCGCTCCCCCCAGGAGATGCTGCGTGCC 404 6 

Qy 1021 TCAGGCCTTACCCAGAAATGGGTACAGCGTGAGATATCCAACTTCGAGTACTTGATGCAA 1080 

Mill MMMMMIMI I MM IIMM II MUM II IIMII llllll 

Db 4 04 7 TCAGGCCTTACCCAGAAATGGGTACAGCGTGAGATATCCAACTTCGAGTACTTGATGCAA 4106 

Qy 1081 CTCAACACCATTGCGGGGCGGACCTACAATGACCTGTCTCAGTACCCTGTGTTCCCCTGG 1140 

II II II I MM MM Mill I llllll II llllll MUM I Mill Ml MUM 
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Db 


4107 


CTCAACACCATTGCGGGGCGGACCTACAATGACCTGTCTCAGTACCCTGTGTTCCCCTGG 


4166 


Qy 


1141 


GTCCTGCAGGACTACGTGTGCCCAACCCTGGACCTCAGCAACCCAGCCGTCTTCCGGGAC 

1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 M 1 1 1 1 1 i 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 

GTCCTGCAGGACTACGTGTCCCCAACCCTGGACCTCAGCAACCCAGCCGTCTTCCGGGAC 


1200 


Db 


4167 


4226 


Qy 


1201 


CTGTCTAAGCCCATCGGTGTGGTGAACCCCAAGCATGCCCAGCTCGTGAGGGAGAAGTAT 

IIIIIIMIIilli Mill III Mill llllhlllllllll IIIIIIIIIMMIIII 

CTGTCTAAGCCCATCGGTGTGGTGAACCCCAAGCATGCCCAGCTCGTGAGGGAGAAGTAT 


1260 


Db 


4227 


4286 


Qy 


1261 


GAAAGCTTTGAGGACCCAGCAGGGACCATTGACAAGTTCCACTATGGCACCCACTACTCC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 

GAAAGCTTTGAGGACCCAGCAGGGACCATTGACAAGTTCCACTATGGCACCCACTACTCC 


132 0 


Db 


4287 


4346 


Qy 


1321 


AATGCAGCAGGCGTGATGCACTACCTCATCCGCGTGGAGCCCTTCACCTCCCTGCACGTC 

IMIMIIIIIIII MMMM MIMMIMI Ml MIMMIIMIMM llllll 

AATGCAGCAGGCGTGATGCACTACCTCATCCGCGTGGAGCCCTTCACCTCCCTGCACGTC 


1380 


Db 


4347 


4406 


Qy 


1381 


CAGCTGCAAAGTGGCCGCTTTGACTGCTCCGACCGGCAGTTCCACTCGGTGGCGGCAGCC 

MMMMMMMIMM MMMM 1 IMMMMMM Mill! MMIMIMM 

CAGCTGCAAAGTGGCCGCTTTGACTGCTCCGACCGGCAGTTCCACTCGGTGGCGGCAGCC 


1440 


Db 


4407 


4466 


Qy 


1441 


TGGCAGGCACGCCTGGAGAGCCCTGCCGATGTGAAGGAGCTCATCCCGGAATTCTTCTAC 

Ml 1 1 1 1 Mill 1 II II 1 II Mill III llllll IMIMI MIMIIMM MUM 

TGGCAGGCACGCCTGGAGAGCCCTGCCGATGTGAAGGAGCTCATCCCGGAATTCTTCTAC 


1500 


Db 


4467 


4526 


Qy 


1501 


TTTCCTGACTTCCTGGAGAACCAGAACGGTTTTGACCTGGGCTGTCTCCAGCTGACCAAC 

IIIIMMIIIIIII llllllll lllllllllll III llllll II 1 1 1 1 1 M 1 M 1 1 1 1 

TTTCCTGACTTCCTGGAGAACCAGAACGGTTTTGACCTGGGCTGTCTCCAGCTGACCAAC 


1560 


Db 


^ 4527 


4586 


Qy 


1561 


GAGAAGGTAGGCGATGTGGTGCTACCCCCGTGGGCCAGCTCTCCTGAGGACTTCATCCAG 

IMIMMMMM IMM III Mill IIIIIIIIIIIIIIMIIIIMI III llllll 

GAGAAGGTAGGCGATGTGGTGCTACCCCCGTGGGCCAGCTCTCCTGAGGACTTCATCCAG 


1620 


Db 


4587 


4646 


Qy 


1621 


CAGCACCGCCAGGCTCTGGAGTCGGAGTATGTGTCTGCACACCTACACGAGTGGATCGAC 

MIMMMMM llllll 1 MIMM 1 IIMM IMIMI MIMIIMM llllll 

CAGCACCGCCAGGCTCTGGAGTCGGAGTATGTGTCTGCACACCTACACGAGTGGATCGAC 


1680 


Db 


4647 


4706 


Qy 


1681 


CTCATCTTTGGCTACAAGCAGCGGGGGCCAGCCGCCGAGGAGGCCCTCAATGTCTTCTAT 

llllll MMIIIIMIII 1 llllllll IMIMMMMM MIIMIMI llllll 

CTCATCTTTGGCTACAAGCAGCGGGGGCCAGCCGCCGAGGAGGCCCTCAATGTCTTCTAT 


1740 


Db 


4707 


4766 


Qy 


1741 


TACTGCACCTATGAGGGGGCTGTAGACCTGGACCATGTGACAGATGAGCGGGAACGGAAG 

MMIMIIIIIMI IMM III Mill IIIIIIIIIIIIIIMIIIIIII III IIMM 

TACTGCACCTATGAGGGGGCTGTAGACCTGGACCATGTGACAGATGAGCGGGAACGGAAG 


1800 


Db 


4767 


4826 


Qy 


1801 


GCTCTGGAGGGCATTATCAGCAACTTTGGGCAGACTCCCTGTCAGCTGCTGAAGGAGCCA 

MMMMIMM llllll 1 Mill III IIMM III IMIMMMIMM llllll 

GCTCTGGAGGGCATTATCAGCAACTTTGGGCAGACTCCCTGTCAGCTGCTGAAGGAGCCA 


1860 


Db 


4827 


4886 


Qy 


1861 


CATCCAACTCGGCTCTCAGCTGAGGAAGCAGCCCATCGCCTTGCACGCCTGGACACTAAC 

MIMM MMMM IMIMMMMM llllllll llllll MMMMMMIMM 

CATCCAACTCGGCTCTCAGCTGAGGAAGCAGCCCATCGCCTTGCACGCCTGGACACTAAC 


1920 


Db 


4887 


4946 
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Qy 1921 TCACCTAGCATCTTCCAGCACCTGGACGAACTCAAGGCATTCTTCGCAGA 1970 

IMM lllhll.ll I II III ill II II I llllll IIIIIIIMIMI 

Db 4 947 TCACCTAGCATCTTCCAGCACCTGGACGAACTCAAGGCATTCTTCGCAGAGGTTGTCAGT 5 006 

Qy 1971 1970 

Db 5 007 GATGGTGTACCCCTGGTGCTAGCCCTGGTCCCCCACCGGCAGCCCCACTCCTTCATCACC 5 066 

Qy 1971 GGTGACTGTGAGTGCCAGTGGGCTGCTGGGCACCCACAGC 2 010 

I I M I II I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 5 0 67 CAGGGTTCCCCAGACCTGTTGGTGACTGTGAGTGCCAGTGGGCTGCTGGGCACCCACAGC 512 6 

Qy 2 011 TGGTTGCCCTATGACCGCAACATAAGCAACTACTTCAGCTTCAGCAAAGACCCCACCATG 2 070 

IIIIMIIIlllilllilMMIMIMI llllllllllllllllllllllllllllll 

Db 512 7 TGGTTGCCCTATGACCGCAACATAAGCAACTACTTCAGCTTCAGCAAAGACCCCACCATG 5186 

Qy 2 071 GGCAGCCACAAGACGCAGCGACTGCTGAGTGGCCCGTGGGTGCCAGGCAGTGGTGTGAGT 213 0 

1 1 1 1 1 1 1 1 1 ll 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 

Db 5187 GGCAGCCACAAGACGCAGCGACTGCTGAGTGGCCCGTGGGTGCCAGGCAGTGGTGTGAGT 5246 

Qy 2131 GGACAAGCACTGGCAGTGGCCCCGGATGGAAAGCTGCTATTCAGCGGTGGCCACTGGGAT 2190 

IMMIIIIMIII II Ml II I II lllllllllllllllllllllllllllll Mill II 

Db 524 7 GGACAAGCACTGGCAGTGGCCCCGGATGGAAAGCTGCTATTCAGCGGTGGCCACTGGGAT 53 06 

Qy 2191 GGCAGCCTGCGGGTGACTGCACTACCCCGTGGCAAGCTGTTGAGCCAGCTCAGGTGCCAC 2 2 5 0 

<M||| || IMII IIIMIIM MUM IMIIIIII II Ml II MM 

Db 5 3 07 GGCAGCCTGCGGGTGACTGCACTACCCCGTGGCAAGCTGTTGAGCCAGCTCAGCTGCCAC 53 66 

Qy 22 51 CTTGATGTAGTAACCTGCCTTGCACTGGACACCTGTGGCATCTACCTCATCTCAGGCTCC 2 310 

1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 II 1 1 1 1 

Db 53 67 CTTGATGTAGTAACCTGCCTTGCACTGGACACCTGTGGCATCTACCTCATCTCAGGCTCC 5426 

Qy 2 311 CGGGACACCACGTGCATGGTGTGGCGGCTCCTGCATCAGGGTGGTCTGTCAGTAGGCCTG 23 70 

llllllllllllilllllllliillllll lllllllllllllllllllllllllllll 

Db 542 7 CGGGACACCACGTGCATGGTGTGGCGGCTCCTGCATCAGGGTGGTCTGTCAGTAGGCCTG 54 86 

Qy 2 3 71 GCACCAAAGCCTGTGCAGGTCCTGTATGGGCATGGGGCTGCAGTGAGCTGTGTGGCCATC 243 0 

IMM II IMIIIMMMIIMIIIIII! IIIIIMM IIMM II MM 

Db 54 87 GCACCAAAGCCTGTGCAGGTCCTGTATGGGCATGGGGCTGCAGTGAGCTGTGTGGCCATC 5546 

Qy 2431 AGCACTGAACTTGACATGGCTGTGTCTGGATCTGAGGATGGAACTGTGATCATACACACT 24 90 

IMM II MMMMMMM MMMM IIMIMM MUM II IMM 

Db 5 54 7 AGCACTGAACTTGACATGGCTGTGTCTGGATCTGAGGATGGAACTGTGATCATACACACT 5606 

Qy 24 91 GTACGCCGCGGACAGTTTGTAGCGGCACTACGGCCTCTGGGTGCCACATTCCGTGGACCT 2550 

Mill IMMIMMMMMIMMIIM MMMM! I! :M M MM 

Db 5607 GTACGCCGCGGACAGTTTGTAGCGGCACTACGGCCTCTGGGTGCCACATTCCCTGGACCT 5666 

Qy 2 551 ATTTTCCACCTGGCATTGGGGTCCGAAGGCCAGATTGTGGTACAGAGCTCAGCGTGGGAA 2 610 

Mill M III II III II I II I MINI lllllllllllllllllllllllllllll 

Db 5667 ATTTTCCACCTGGCATTGGGGTCCGAAGGCCAGATTGTGGTACAGAGCTCAGCGTGGGAA 5 72 6 

Qy 2 611 CGTCCTGGGGCCCAGGTCACCTACTCCTTGCACCTGTATTCAGTCAATGGGAAGTTGCGG 2 670 

MINI Mill III II I MM I II I III II I lllllllllllllllllllllllllllll 
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Db 


5727 


CGTCCTGGGGCCCAGGTCACCTACTCCTTGCACCTGTATTCAGTCAATGGGAAGTTGCGG 


5786 


Qy 


2671 


GCTTCACTGCCCCTGGCAGAGCAGCCTACAGCCCTGACGGTGACAGAGGACTTTGTGTTG 

MIMM IMMMMIMM II MUM MMIMM; MUM II IMMM 

GCTTCACTGCCCCTGGCAGAGCAGCCTACAGCCCTGACGGTGACAGAGGACTTTGTGTTG 


2730 


Db - 


5787 


5846 


Qy 


2731 


CTGGGCACCGCCCAGTGCGCCCTGCACATCCTCCAACTAAACACACTGCTCCCGGCCGCG 

IIIIIIIMIIIIMIIIIIIIMIIIMIIIIMIIIIIIMIMIIIIIIMMMII 

CTGGGCACCGCCCAGTGCGCCCTGCACATCCTCCAACTAAACACACTGCTCCCGGCCGCG 


2790 


Db 


5847 


5906 


Qy 


2791 


CCTCCCTTGCCCATGAAGGTGGCCATCCGCAGCGTGGCCGTGACCAAGGAGCGCAGCCAC 

1 1 1 1 1 1 1 I 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CCTCCCTTGCCCATGAAGGTGGCCATCCGCAGCGTGGCCGTGACCAAGGAGCGCAGCCAC 


2850 


Db 


5907 


5966 


Qy 


2851 


GTGCTGGTGGGCCTGGAGGATGGCAAGCTCATCGTGGTGGTCGCGGGGCAGCCCTCTGAG 

lllllllllll II MINI III MINIM IIIIMrilllllllllllllllMIM II 

GTGCTGGTGGGCCTGGAGGATGGCAAGCTCATCGTGGTGGTCGCGGGGCAGCCCTCTGAG 


2910 


Db 


5967 


6026 


Qy 


2911 


GTGCGCAGCAGCCAGTTCGCGCGGAAGCTGTGGCGGTCCTCGCGGCGGATCTCCCAGGTG 

Ml lllill IIMII lllllllli IMII 1, Mill, li ,11111 MM 

GTGCGCAGCAGCCAGTTCGCGCGGAAGCTGTGGCGGTCCTCGCGGCGCATCTCCCAGGTG 


2970 


Db 


6027 


6086 


Qy 


2971 


TCCTCGGGAGAGACGGAATACAACCCTACTGAGGCGCGC 3 009 

Mill IN III III II Ml iM IMII III Mlllll II 

TCCTCGGGAGAGACGGAATACAACCCTACTGAGGCGCGC 612 5 




Db 


6087 
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Sequence 827, Application US/09023655 
Patent No. 6607879 
GENERAL INFORMATION: 

APPLICANT: Cocks, Benjamin G. 
INFORMATION FOR SEQ ID NO: 827: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 1595 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
IMMEDIATE SOURCE: 

LIBRARY: LUNGAST01 



US-09- 


CLONE : 
023-655- 


873034 
-827 




Query Match 38.7%; Score 1367.2; DB 4; Length 1595; 
Best Local Similarity 93.6%; Pred. No. 4.2e-304; 

Matches 1494; Conservative 0; Mismatches 10; Indels 92; Gaps 


3; 


Qy 


1807 


GGAGAACCAGAACGGTTTTGACCTGGGCTGTCTCCAGCTGACCAACGAGAAGGTAGGCGA 

I I I It I I 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1! 1 1 II 1 1 ! 1 1 1 1 M 1 1 1 1 1 1 1 1 
M 1 1 1 II II 1 II II 1 1 M 1 M II II II II II II II II II 1 II II II M M M 1 1 1 1 II M 

GGAGAACCAGAALCjCj 1 1 1 10AL.L- 1 (j\aLiv„ IVjj 1 1 L-U-AAjtL. 1 ^/\L.L.i\rtU^/^Ui^o^ li-lOVj^oi-i 


1866 


Db 


1 


60 


Qy 


1867 


TGTGGTGCTACCCCCGTGGGCCAGCTCTCCTGAGGACTTCATCCAGCAGCACCGCCAGGC 

1 Mill 111111111111111111111 III MIMMMIIMM II Mill II MM 

TGTGGTGCTACCCCCGTGGGCCAGCTCTCCTGAGGACTTCATCCAGCAGCACCGGCAGGG 


1926 


Db 


61 


120 


Qy 


1927 


TCTGGAGTCGGAGTATGTGTCTGCACACCTACACGAGTGGATCGACCTCATCTTTGGCTA 

II II 1 1 1 1 II 1 II II M M II 1 II 1 1 1 M II M II II 1 1 II M 1 II 1 1 II 1 1 II II II 

TCTGGAGTCGGAGTATGTGTCTGCACANCTACACGAGTGGATCGACCTCATCTTTGGNTA 


1986 


Db 


121 


180 


Qy 


1987 


CAAGCAGCGGGGGCCAGCCGCCGAGGAGGCCCTCAATGTCTTCTATTACTGCACCTATGA 

IIIMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIMIMIMM 

TAAGCAGCGGGGGCCAGCCGCCGAGGAGGCCCTCAATGTCTTCTATTACTGCACCTATGA 


2046 


Db 


181 


240 


Qy 


2047 


GGGGGCTGTAGACCTGGACCATGTGACAGATGAGCGGGAACGGAAGGCTCTGGAGGGCAT 

II II III 1 M II II 1 1 II 1 II II 1 II 1 M 1 II II 1 1 II 1 II 1 II 1 II M II 1 II 1 M 1 1 1 

GGGGGCTGTAGACCTGGACCATGTGACAGATGAGCGGGAACGGAAGGCTCTGGAGGGCAT 


2106 


Db 


241 


300 


Qy 


2107 


TATCAGCAACTTTGGGCAGACTCCCTGTCAGCTGCTGAAGGAGCCACATCCAACTCGGCT 

M 1 II II 1 M II II II 1 1 1 1 M II M II 1 II II II II 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 M 

TATCAGCAACTTTGGGCAGACTCCCTGTCAGCTGCTGAAGGAGCCACATCCAACTCGGCT 


2166 


Db 


301 


360 


Qy 


2167 


CTCAGCTGAGGAAGCAGCCCATCGCCTTGCACGCCTGGACACTAACTCACCTAGCATCTT 

M II 1 M II 1 M 1 1 II 1 1 II 1 II 1 1 1 1 1 1 1 1 II 1 1 1 II M 1 M 1 1 1 1 M 1 M 1 1 1 1 M M 

CTCAGCTGAGGAAGCAGCCCATCGCCTTGCACGCCTGGACACTAACTCACCTAGCATCTT 


2226 


Db 


361 


420 


Qy 


2227 


CCAGCACCTGGACGAACTCAAGGCATTCTTCGCAGA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CCAGCACCTGGACGAACTCAAGGCATTCTTCGCAGAGGTTGTCAGTGATGGTGTACCCCT 


2262 


Db 


421 


480 


Qy 


2263 




2262 




Db 


481 


GGTGCTAGCCCTGGTCCCCCACCGGCAGCCCCACTCCTTCATCACCCAGGGTTCCCCAGA 


540 
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Qy 


2263 


GGTGACTGTGAGTGCCAGTGGGCTGCTGGGCACCCACAGCT - GGTTGCCCTATG 

IIIIIIIIIIMIII MINI III Ml III MINIM III MINIMUM 

CCTGTTGGTGACTGTGAGTGCCAGTGGGCTGCTGGGCACCCACAGCTGGGTTGCCCTATG 


2315 


Db 


541 


600 


Qy 


2316 


ACCGCAACATAAGCAACTACTTCAGCTTCAGCAAAGACCCCACCATGGGCAGCCACAAGA 

IIIIIIIIIIIMIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIMIIIIIIIIIMI 

ACCGCAACATAAGCAACTACTTCAGCTTCAGCAAAGACCCCACCATGGGCAGCCACAAGA 


2375 


Db 


601 


660 


Qy 


2376 


CGCAGCGACTGCTGAGTGGCCCGTGGGTGCCAGGCAGTGGTGTGAGTGGACAAGCACTGG 

1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CGCAGCGACTGCTGAGTGGCCCGTGGGTGCCAGGCAGTGGTGTGAGTGGACAAGCACTGG 


2435 


Db 


661 


720 


Qy 


2436 


CAGTGGCCCCGGATGGAAAGCTGCTATTCAGCGGTGGCCACTGGGATGGCAGCCTGCGGG 

Mill IIIMII MINIMI MINIMI Mill III Mill III 1 1 1 1 1 1 1 1 1 1 1 1 1 

CAGTGGCCCCGGATGGAAAGCTGCTATTCAGCGGTGGCCACTGGGATGGCAGCCTGCGGG 


2495 


Db 


721 


780 


Qy 


2496 


TGACTGCACTACCCCGTGGCAAGCTGTTGAGCCAGCTCAGCTGCCACCTTGATGTAGTAA 

NNNMINNMNNNNNNNNNNNMININNNNNNIMM 

TGACTGCACTACCCCGTGGCAAGCTGTTGAGCCAGCTCAGCTGCCACCTTGATGTAGTAA 


2555 


Db 


781 


840 


Qy 


2556 


CCTGCCTTGCACTGGACACCTGTGGCATCTACCTCATCTCAGGCTCCCGGGACACCACGT 

MMIIIIIIIIMIMMII lllllllilllllll III Mill IIIIIIMIMMM 

CCTGCCTTGCACTGGACACCTGTGGCATCTACCTCATCTCAGGCTCCCGGGACACCACGT 


2615 


Db 


841 


900 


Qy 


2616 


GCATGGTGTGGCGGCTCCTGCATCAGGGTGGTCTGTCAGTAGGCCTGGCACCAAAGCCTG 

1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 II 

GCATGGTGTGGCGGCTCCTGCATCAGGGTGGTCTGTCAGTAGGCCT.GGCACCAAAGCCTG 


2675 


Db 


901 


960 


Qy 


2676 


TGCAGGTCCTGTATGGGCATGGGGCTGCAGTGAGCTGTGTGGCCATCAGCACTGAACTTG 

1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 II M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 N 1 

TGCAGGTCCTGTATGGGCATGGGGCTGCAGTGAGCTGTGTGGCCATCAGCACTGAACTTG 


2735 


Db 


961 


1020 


Qy 


2736 


ACATGGCTGTGTCTGGATCTGAGGATGGAACTGTGATCATACACACTGTACGCCGCGGAC 

INN II INN III MINI WWW III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III INN INN 

ACATGGCTGTGTCTGGATCTGAGGATGGAACTGTGATCATACACACTGTACGCCGCGGAC 


2795 


Db 


1021 


1080 


Qy 


2796 


AGTTTGTAGCGGCACTACGGCCTCTGGGTGCCACATTCCCTGGACCTATTTTCCACCTGG 

MM IMIMI 1 IMIIIM IIIMMMIIIIIIII IIIIIIIMIll Mill Mil 

AGTTTGTAGCGGCACTACGGCCTCTGGGTGCCACATTCCCTGGACCTATTTTCCACCTGG 


2855 


Db 


1081 


1140 


Qy 


jo 


rUTTdrtClCiTCrCi A annrr AnATTGTGGTACAGAGCTCAGCGTGGGAACGTCCTGGGGCCC 

1 1 1 1 1 1 1 1 1 II N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 

CATTGGGGTCCGAATGCCAGATTGTGGTACAGAGCTCAGCGTGGGAACGTCCTGGGGCCC 


2915 


Db 


1141 


1200 


Qy 


2916 


AGGTCACCTACTCCTTGCACCTGTATTCAGTCAATGGGAAGTTGCGGGCTTCACTGCCCC 

IMIIIIIIIIIMMIMII MIMI III IIMIMI MMIMIMIMIMIMM 

AGGTCACCTACTCCTTGCACCTGTATTCAGTCAATGGGAAGTTGCGGGCTTCACTGCCCC 


2975 


Db 


1201 


1260 


Qy 


2976 


TGGCAGAGCAGCCTACAGCCCTGACGGTGACAGAGGACTTTGTGTTGCTGGGCACCGCCC 

II 1 II II II II II II 1 1 II II II 1 1 1 II II II 1 II 1 N II II 1 1 1 1 1 1 1 II II 1 II II 1 1 
TGGCAGAGCAGCCTACAGCCCTGACGGTGACAGAGGACTTTGTGTTGCTGGGCACCGCCC 


3035 


Db 


1261 


1320 


Qy 


3036 


AGTGCGCCCTGCACATCCTCCAACTAAACACACTGCTCCCGGCCGCGCCTCCCTTGCCCA 


3095 
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Db 


1321 


MINIUM II MINIMI II INI MINI INN MINI M IMIIIIIIIIIII 

AGTGCGCCCTGCACATCCTCCAACTAAACACACTGCTCCCGGCCGCGCCTCCCTTGCCCA 


1380 


Qy 


3096 


TGAAGGTGGCCATCCGCAGCGTGGCCGTGACCAAGGAGCGCAGCCACGTGCTGGTGGGCC 

MM] III :M II M: llllll II III INI II INI II Ml 1 Mill II 

TGAAGGTGGCCATCCGCAGCGTGGCCGTGACCAAGGAGCGCA-CCACGTGCTGGTGGGCC 


3155 


Db 


1381 


1439 


Qy 


3156 


TGGAGGATGGCAAGCTCATCGTGGTGGTCGCGGGGCAGCCCTCTGAGGTGCGCAGCAGCC 

III III IMIMIIIMIMIill! II llllill II llllll lllllll III II 

TGGAGGATGGCAAGCTCATCGTGGTGGTCGCGGGGCAGCCCTCTGAGGTGCGCACAGCCC 


3215 


Db 


1440 


1499 


Qy 


3216 


AGTTCGCGCGGAAGCTGTGGCGGTCCTCGCGGCGCATCTCCCAGGTGTCCTCGGGAGAGA 


3275 


Db 


1500 


M 1 1 II 1 1 1 1 N 1 1 1 1 N 1 1 1 1 II 1 1 1 1 1 1 N 1 1 1 II 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 II 1 N 

AGTTCGCGCGGAAGCTGTGGCGGTCCTCGCGGCGCATCTCCCAGGTGTCCTCGGGAGAGA 


1559 


Qy - 


3276 


CGGAATACAACCCTACTGAGGCGCGCTGAACCTGGC 3 311 

MM llllll II llllll MUM MUM 

GGGAATACAACCCTACTGAGGCGCGCTGAACCTGGC 15 95 




Db 


1560 
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Coding only 

; Sequence 827, Application US/09023655 
; Patent No. 6607879 
; GENERAL INFORMATION: 
; APPLICANT: Cocks, Benjamin G. 
INFORMATION FOR SEQ ID NO: 82 7: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 15 95 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : single 
; TOPOLOGY: linear 

IMMEDIATE SOURCE : 

LIBRARY: LUNG AST 01 
CLONE: 873 034 
US-09-023-655-827 



Query Match 45.1%; Score 1357.2; DB 4; Length 1595; 

Best Local Similarity 93.6%; Pred. No. 6.9e-308; 

Matches 1484; Conservative 0; Mismatches 10; Indels 92; Gaps 3; 



Qy 


1515 


GG AG AACCAGAACGGTTTTGACCTGGGCTGTCTCCAGCTGACCAACGAGAAGGTAGGCGA 

MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 

GGAGAACCAGAACGGTTTTGACCTGGGCTGTCTCCAGCTGACCAACGAGAAGGTAGGCGA 


1574 


Db 


1 


60 


Qy 


1575 


TGTGGTGCTACCCCCGTGGGCCAGCTCTCCTGAGGACTTCATCCAGCAGCACCGCCAGGC 

MMI II MUM 1 MMIMIMMMI lllllllllllll Ml illlllll MM 

TGTGGTGCTACCCCCGTGGGCCAGCTCTCCTGAGGACTTCATCCAGCAGCACCGGCAGGG 


1634 


Db 


61 


120 


Qy 


1635 


TCTGGAGTCGGAGTATGTGTCTGCACACCTACACGAGTGGATCGACCTCATCTTTGGCTA 

IIIIMI IIMM llllll II MMI MMMMMMMMMMMMMMI M 

TCTGGAGTCGGAGTATGTGTCTGCACANCTACACGAGTGGATCGACCTCATCTTTGGNTA 


1694 


Db 


121 


180 


Qy 


1695 


CAAGCAGCGGGGGCCAGCCGCCGAGGAGGCCCTCAATGTCTTCTATTACTGCACCTATGA 

M M M M M 1 M M I'M M M M 1 M M M M M M M M M M M M M M M M M 

TAAGCAGCGGGGGCCAGCCGCCGAGGAGGCCCTCAATGTCTTCTATTACTGCACCTATGA 


1754 


Db 


181 


240 


Qy 


1755 


GGGGGCTGTAGACCTGGACCATGTGACAGATGAGCGGGAACGGAAGGCTCTGGAGGGCAT 

MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 

GGGGGCTGTAGACCTGGACCATGTGACAGATGAGCGGGAACGGAAGGCTCTGGAGGGCAT 


1814 


Db 


241 


300 


Qy 


1815 


TATCAGCAACTTTGGGCAGACTCCCTGTCAGCTGCTGAAGGAGCCACATCCAACTCGGCT 

MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 

TATCAGCAACTTTGGGCAGACTCCCTGTCAGCTGCTGAAGGAGCCACATCCAACTCGGCT 


1874 


Db 


301 


360 


Qy 


1875 


CTCAGCTGAGGAAGCAGCCCATCGCCTTGCACGCCTGGACACTAACTCACCTAGCATCTT 

MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 

CTCAGCTGAGGAAGCAGCCCATCGCCTTGCACGCCTGGACACTAACTCACCTAGCATCTT 


1934 


Db 


361 


420 


Qy 


1935 


CCAGCACCTGGACGAACTCAAGGCATTCTTCGCAGA 

MMMMMMMMMMMMMMMMMM 

CCAGCACCTGGACGAACTCAAGGCATTCTTCGCAGAGGTTGTCAGTGATGGTGTACCCCT 


1970 


Db 


421 


480 



Qy 



1971 



1970 
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Db 4 81 GGTGCTAGCCCTGGTCCCCCACCGGCAGCCCCACTCCTTCATCACCCAGGGTTCCCCAGA 54 0 
Qy 1971 GGTGACTGTGAGTGCCAGTGGGCTGCTGGGCACCCACAGCT - GGTTGCCCTATG 2 02 3 

MMMMMMMIMMMMMMMMMMM. MM Mill II 

Db 541 CCTGTTGGTGACTGTGAGTGCCAGTGGGCTGCTGGGCACCCACAGCTGGGTTGCCCTATG 600 

Qy 2024 ACCGCAACATAAGCAACTACTTCAGCTTCAGCAAAGACCCCACCATGGGCAGCCACAAGA 2 083 

M I II M I MIIIIIIIIMIIIIIMMIIM IMIMI Ml 

D b 601 ACCGCAACATAAGCAACTACTTCAGCTTCAGCAAAGAC.CCCACCATGGGCAGCCACAAGA 660 

Qy 2 084 CGCAGCGACTGCTGAGTGGCCCGTGGGTGCCAGGCAGTGGTGTGAGTGGACAAGCACTGG 2143 

M I II 1 1 M II 1 1 M 1 1 II 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 II I II 1 1 1 1 1 M M 1 1 1 II I II I 

Db 661 CGCAGCGACTGCTGAGTGGCCCGTGGGTGCCAGGCAGTGGTGTGAGTGGACAAGCACTGG 72 0 

Qy 2144 CAGTGGCCCCGGATGGAAAGCTGCTATTCAGCGGTGGCCACTGGGATGGCAGCCTGCGGG 22 03 

M II M II 1 1 1 1 1 1 M I II 1 1 1 1 II 1 1 II 1 1 1 1 M I II 1 1 1 1 II I M 1 1 M II II M II I 

D b 721 CAGTGGCCCCGGATGGAAAGCTGCTATTCAGCGGTGGCCACTGGGATGGCAGCCTGCGGG 78 0 

Qy 22 04 TGACTGCACTACCCCGTGGCAAGCTGTTGAGCCAGCTCAGCTGCCACCTTGATGTAGTAA 2263 

I M 1 1 1 1 1 IMMMMMMMMMMMMMMMMMMI 

Db 781 TGACTGCACTACCCCGTGGCAAGCTGTTGAGCCAGCTCAGCTGCCACCTTGATGTAGTAA 840 

Qy 2264 CCTGCCTTGCACTGGACACCTGTGGCATCTACCTCATCTCAGGCTCCCGGGACACCACGT 2 323 

II II II I II II II 1 1 II 1 1 1 1 1 1 II II I II M I II I II II 1 1 II 1 1 II 1 1 1 1 1 II II M I 

Db 841 CCTGCCTTGCACTGGACACCTGTGGCATCTACCTCATCTCAGGCTCCCGGGACACCACGT 90 0 

Qy 2 324 GCATGGTGTGGCGGCTCCTGCATCAGGGTGGTCTGTCAGTAGGCCTGGCACCAAAGCCTG 23 8 3 

M 1 1 M M 1 1 II II I II 1 1 1 1 1 1 II 1 1 II I II 1 1 1 1 1 1 II 1 1 1 II I M II 1 1 1 II I M 1 1 

D b 901 GCATGGTGTGGCGGCTCCTGCATCAGGGTGGTCTGTCAGTAGGCCTGGCACCAAAGCCTG 960 

Qy 23 84 TGCAGGTCCTGTATGGGCATGGGGCTGCAGTGAGCTGTGTGGCCATCAGCACTGAACTTG 2 443 

M II I II II II 1 1 1 1 II 1 1 II 1 1 1 1 1 II I I M I II 1 1 1 II 1 1 M I I M I M 1 1 II I M 1 1 

Db 961 TGCAGGTCCTGTATGGGCATGGGGCTGCAGTGAGCTGTGTGGCCATCAGCACTGAACTTG 102 0 

Qy 2444 ACATGGCTGTGTCTGGATCTGAGGATGGAACTGTGATCATACACACTGTACGCCGCGGAC 2 5 03 

MMMMMMMMIMM II II II M I M II M II I II I II 1 1 Mill II IMIMI 

Db 1021 ACATGGCTGTGTCTGGATCTGAGGATGGAACTGTGATCATACACACTGTACGCCGCGGAC 1080 

Qy 2504 AGTTTGTAGCGGCACTACGGCCTCTGGGTGCCACATTCCCTGGACCTATTTTCCACCTGG 2 563 

II 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 II 1 1 II II M II 1 1 1 1 M I II II 1 1 M I M 

Db 1081 AGTTTGTAGCGGCACTACGGCCTCTGGGTGCCACATTCCCTGGACCTATTTTCCACCTGG 114 0 

Qy 2 564 CATTGGGGTCCGAAGGCCAGATTGTGGTACAGAGCTCAGCGTGGGAACGTCCTGGGGCCC 2 62 3 

II II 1 1 1 M II II I IIIIIIMIIIIMIIIIIIIIIIIIIIIIIIMMIMIIIMI 

Db 1141 CATTGGGGTCCGAATGCCAGATTGTGGTACAGAGCTCAGCGTGGGAACGTCCTGGGGCCC 12 0 0 

Qy 2 62 4 AGGTCACCTACTCCTTGCACCTGTATTCAGTCAATGGGAAGTTGCGGGCTTCACTGCCCC 2 683 

I II II I M II 1 1 1 M I II 1 1 1 M M II I II 1 1 1 1 III 1 1 1 M 1 1 1 1 II II 1 1 II I II M I 

Db 12 01 AGGTCACCTACTCCTTGCACCTGTATTCAGTCAATGGGAAGTTGCGGGCTTCACTGCCCC 126 0 

Qy 2 684 TGGCAGAGCAGCCTACAGCCCTGACGGTGACAGAGGACTTTGTGTTGCTGGGCACCGCCC 2 74 3 

IMIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIMMIIIIIIIIIIMIMIIIII 

Db 12 61 TGGCAGAGCAGCCTACAGCCCTGACGGTGACAGAGGACTTTGTGTTGCTGGGCACCGCCC 132 0 
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IIIIIIIMIMIIIMIMIMIIIIIIIMIIMIIMIIIIIMIIIIIIIIIIIII 

AGTGCGCCCTGCACATCCTCCAACTAAACACACTGCTCCCGGCCGCGCCTCCCTTGCCCA 
TGAAGGTGGCCATCCGCAGCGTGGCCGTGACCAAGGAGCGCAGCCACGTGCTGGTGGGCC 

MIIIMMIIIIIIIIIIIIIIIMIIIIIIIIIIIMIII IMIIIIMIIIIIIM 

TGAAGGTGGCCATCCGCAGCGTGGCCGTGACCAAGGAGCGCA-CCACGTGCTGGTGGGCC 
TGGAGGATGGCAAGCTCATCGTGGTGGTCGCGGGGCAGCCCTCTGAGGTGCGCAGCAGCC 

1 1 1 1 1 1 1 11 1 1 1 1 1 1 It 1 1 1 1 1 1 1 1 1 1 1 1 1 IIIIIMIIIMI I M 

TGGAGGATGGCAAGCTCATCGTGGTGGTCGCGGGGCAGCCCTCTGAGGTGCGCACAGCCC 
AGTTCGCGCGGAAGCTGTGGCGGTCCTCGCGGCGCATCTCCCAGGTGTCCTCGGGAGAGA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II M 1 1 1 1 1 1 1 M i 1 1 M 1 1 1 1 1 1 1 1 1 1 M II 1 1 1 1 1 1 1 1 1 1 1 1 1 

AGTTCGCGCGGAAGCTGTGGCGGTCCTCGCGGCGCATCTCCCAGGTGTCCTCGGGAGAGA 
CGGAATACAACCCTACTGAGGCGCGC 3 0 09 

Mi 1 UNI i I II III II MM 
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hypothetical protein KIAA0540 - human (fragment) 
C; Species: Homo sapiens (man) 

C;Date: Ol-Feb-1999 #sequence_revision Ol-Feb-1999 #text_change ll-Jan-2002 
C; Accession: TO 02 71 

R;Nagase, T.; Ishikawa, K. ; Miyajima, M . ; Tanaka, A.; Kotani, H.; Nomura, N. ; 
Ohara, O. 

DNA Res. 5, 31-39, 1998 

A,- Title: Prediction of the coding sequences of unidentified human genes. IX. The 
complete sequences of 100 new cDNA clones from brain which can code for large 
proteins in vitro. 

A;Reference number: Z14086; MUID : 982 90545 ; PMID:9628581 
A;Accession: T00271 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A;Residues: 1-1113 <NAG> 

A;Cross-ref erences : EMBL : AB011112 ; NID : dll853 82 ; PIDN : BAA25466 . 1 

A; Experimental source: brain 

C; Genetics: 

A; Note: KIAA0540 

Query Match 34.3%; Score 1802; DB 2; Length 1113; 

Best Local Similarity 100.0%; Pred. No. 4.3e-121; 

Matches 345; Conservative 0; Mismatches 0; Indels 0; Gaps 0 ; 

MSQFEMDTYAKSHDLMSGFWNACYDMLMSSGQRRQWERAQSRRAFQELVLEPAQRRARLE 6 0 

MM 1 1 Mill Mill III Ml MINI 1 1 1 1 1 Ml 1 1 1 1 1 1 1 1 MINI Mill II 

MSQFEMDTYAKSHDLMSGFWNACYDMLMSSGQRRQWERAQSRRAFQELVLEPAQRRARLE 815 

GLRYTAVLKQQATQHSMALLHWGALWRQLASPCGAWALRDTPIPRWKLSSAETYSRMRLK 12 0 

II I I I I I I I II II I M I I I I I I I I II I II I II II I I I I I I I I M I I I I I I I I I M II I M 
GLRYTAVLKQQATQHSMALLHWGALWRQLASPCGAWALRDTPIPRWKLSSAETYSRMRLK 875 

LVPNHHFDPHLEASALRDNLGEVPLTPTEEASLPLAVTKEAKVSTPPELLQEDQLGEDEL 18 0 

1 1 1 1 1 1 II I II ! 1 1 1 III III Mill 1 1 1 1 1 1 1 1 j 1 1 II 1 1 Mill III Illlllll 

LVPNHHFDPHLEASALRDNLGEVPLTPTEEASLPLAVTKEAKVSTPPELLQEDQLGEDEL 935 
AELETPMEAAELDEQREKLVLSAECQLVTWAWPGLLEVTTQNVYFYDGSTERVETEEG 24 0 

I II I M 1 1 1 1 1 M 1 1 1 1 1 M M I M 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 

AELETPMEAAELDEQREKLVLSAECQLVTWAWPGLLEVTTQNVYFYDGSTERVETEEG 995 

IGYDFRRPLAQLREVHLRRFNLRRSALELFFIDQANYFLNFPCKVGTTPVSSPSQTPRPQ 3 00 

II I M I I I II I I I M I I II I I I I II I I I I I I I I I I I I I II II I I I I M I I I II I I I I I II 
IGYDFRRPLAQLREVHLRRFNLRRSALELFFIDQANYFLNFPCKVGTTPVSSPSQTPRPQ 1055 

PGPIPPHTQVRNQVYSWLLRLRPPSQGYLSSRSPQEMLRASGLTQ 345 

I I I 1 1 1 1 1 1 ! I ! ! I ! I : ! I ! ! I ! 1 1 ! 1 1 1 1 1 1 1 ! f ! ! 1 1 1 ! 1 1 ! I 
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